Corpus: arz_wikipedia_2014_10K

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 13421 ا-
2 4812 ب-
3 3536 م-
4 2725 و-
5 2681 ل-
Top Character Bigrams
word rank frequency n-gram
1 10152 ال-
2 1334 بي-
3 1000 با-
4 934 لل-
5 898 وا-
Top Character Trigrams
word rank frequency n-gram
1 1753 الم-
2 988 الا-
3 735 بال-
4 612 الت-
5 586 وال-
Top Character 4-Grams
word rank frequency n-gram
1 160 المت-
2 157 الاس-
3 137 المس-
4 127 المع-
5 122 المن-
Top Character 5-Grams
word rank frequency n-gram
1 71 محمد-
2 64 الاست-
3 51 جامعة-
4 46 المست-
5 35 المعا-
619 msec needed at 2024-07-18 14:06